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L7 ANSWER 1 OF 5 BIOSIS COPYRIGHT 2001 BIOSIS 
ACCESSION NUMBER: 1998:490986 BIOSIS 
DOCUMENT NUMBER: PREV 199800490986 
TITLE: Prediction of binding constants of 

protein ligands: A fast method for the 
prioritization of hits obtained from de novo design or 3D 
database search programs. 
AUTHORfSY Boehm, Hans- Joachim (1) , . . 

CORPORATE SOURCE: (1) Hoffmann-La Roche Ltd., Pharmaceuticals Division, 

Computational Chemistry, CH-4070 Basel Switzerland 
SOURCE- Journal of Computer- Aided Molecular Design, (July, 1998) 

Vol. 12, No. 4, pp. 309-323. 
ISSN: 0920-654X. 
DOCUMENT TYPE: Article 
LANGUAGE: English 

AB A dataset of 82 protein-ligand complexes of known 3D 
structure and binding constant Ki was analysed to elucidate the 
important factors that determine the strength of protein 
-ligand interactions. The following parameters were investigated, the 
number and geometry of hydrogen bonds and ionic interactions between the 
protein and the ligand, the size of the lipophilic contact 
surface, the flexibility of the ligand, the electrostatic potential in the 



* 



binding site, water molecules in the binding site, cavities along the 
protein-ligand interface and specific interactions between 
aromatic rings. Based on these parameters, a new empirical scoring 
function is presented that estimates the free energy of binding for a 
protein-ligand complex of known 3D structure. The 
function distinguishes between buried and solvent accessible hydrogen 
bonds. It tolerales deviations in the hydrogen bond geometry of up to 0.2 
ANG in the length and up to 30degree in the hydrogen bond angk . without 
penalizing the score. The new energy function reproduces the binding 
cons ants (ranging from 3 .7 X 10-2 M to I X 10-14 M, corresponding to 
binding Lergfesbetween -8 and -80 kJ/mol) of the dataset with a standard 
deviation of 7.3 kJ/mol corresponding to 1.3 orders of magnitude in 
bhrtng affinity. The function can be evaluated very fast and is therefore 
also suitable for the application in a 3D database search or de novo 
ligand design program such as LUDI. The physical significance of the 
individual contributions is discussed. 

L7 ANSWER 2 OF 5 BIOSIS COPYRIGHT 200 1 BIOSIS 

ACCESSION NUMBER: 1998:50503 BIOSIS 

DOCUMENT NUMBER: PREV199800050503 

TITLE: Identification of novel farnesyl protein 

transferase inhibitors using three- 
dimensional database searching methods. 

AUTHORfS) Kaminski, James J. (1); Rane, D. F .; Snow Mark E, Weber, 
Lois; Rothofsky, Marnie L.; Anderson, Samantha D.; Lin, 

CORPORATIVE: m^^^^^^^Z^ 

SOURCE: Journal of Medicinal Chemistry, (Dec. 2, 1997) Vol. 4U, ino. 

25, pp. 4103-4112. 
ISSN: 0022-2623. 
DOCUMENT TYPE: Article 

^Gen^Lofafhl-L 

correlates the biological activity of a series of farnesyl protein 
transferase (FPT) inhibitors, exemplified by the prototype 
l(4-pyridy^ 

b) P yridin-ll-ylidene)pipendine, Sch 44342, 1, with their chemical 
structure was accomplished using the three-dimensional 
quantitative structure-activity relationship (3D-QSAR) software 
program Catalyst. On the basis of the in vitro FPT inhibitory activity of 
a training set of compounds, a five-feature hypothesis containing four 
hydrophobic and one hydrogen bond acceptor region was generated. Using 
this hypothesis as a three-dimensional query to search our corporate 
database identified 718 compounds (hits). Determination of the 
in vitro FPT inhibitory activity using available compounds from this 
"hitlist" identified five compounds, representing three structurally novel 
classes, that exhibited in vitro FPT inhibitory activity, IC50 ltoreq 5 



muM From these three classes, a series of substituted 
rhydrobenzothiophenes was selected 

inhibitory activity relationship stud.es. The results from these stud.es 
is discussed. 
L7 ANSWER 3 OF 5 MEDLINE 
ACCESSION NUMBER: 1998020698 MEDLFNE 
DOCUMENT NUMBER. 98020698 PubMed ID. 9,77091 
TITLE The discovery, characterization and crystallograph.cally 

determined binding mode of an FMOC-conta.nmg 
inhibitor of HIV-1 protease. H . 
AUTHOR: J?^^ c > $^^1^ \ f 

CORPORAM^ 

California at San Francisco, 94143, U.S.A. 

SOURCE^ ^^^OORGAlNICANtrM^lCINAL CHEMISTRY, (1997 Jul) 5 (7) 
131 1-20 

Journal code: B38; 9413298. ISSN: 0968-0896. 
PT IB COUNTRY ENGLAND: United Kingdom 
PUB. COUN IK^ ^ (J0URNAL ARTICLE) 

LANGUAGE: English 
FILE SEGMENT: Priority Journals 
ENTRY MONTH: 199711 
ENTRY DATE: Entered STN: 19971224 
Last Updated on STN: 19971224 
Entered Medline: 19971110 
AB A pharmacophore derived from the structure 

derivative of haloperidol bound in the active site of the HIV-1 protease 

fn an extended conformation that results tn occupancy of the S2, SI , and 
ST subsites of the active site. Initial structure-activity 

with active site residues and is important for binding, (2) the N(O) tosyi 
™u i necessary to suppress pronation of the argimne guanidinyl 
feZus and 3 the arginine carboxamide function is involved in 

FMOC-protected arginine derivatives, which appear to be 
Specific and nontoxic, offer promise for the development of useful HIV-1 
protease inhibitors. 
L7 ANSWER 4 OF 5 SCISEARCH COPYRIGHT 2001 ISI (R) 



ACCESSION NUMBER: 1999:316486 SCISEARCH 
EDINBURGH^ ^ MIDLOTHIAN, SCOTLAND 

COUNTRY OF ^JW^^vaUX*. (,6 APR 1999) Vo.. 287, 

No. 

PubTishenAC^ADEMIC PRESS LTD, 24-28 OVAL RD, LONDON NWI 

7DX, ENGLAND. 

ISSN: 0022-2836. 
DOCUMENT TYPE: Article; Journal 
FILE SEGMENT: LIFE 
LANGUAGE: English 

MFE ™ A C B ° s XcT is' AVAILABLE IN THE ALL AND I ALL FORMATS* 

*ABSlKACi la AvniL-nu mnrif se( j t0 screen small 

AB The molecular docking computer program SANDOCK was used 
mnlpmle three-dimensional databases in the 

ta, fo nov^FKBP inhibitors. Spectroscopic measurements confirmed 
Sins of over 20 compounds to the target prot«.n, some with 
dSssoefation constants in" the tow micromolar range. The dtscovery that 
bindina protein is a steroid binding protein may 
Serological .gniftcance Tw^^^T^ * 
determine the steroid binding mode and confirmed the interactions 
predated by the docking program. (C) 1999 Acadermc Press. 
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ACCESSION NUMBER: 96:641597 SCISEARCH 

™ NUW %Tp C roxCdata-bank-current status and 

AUTHOR: FUT ^™°prin0; MANNING N O; PRILUSKY ,; STAMPF D 

m73 K^^^ B ra»-- 



1L .76100 REHOVOT, ISRAEL. BROOKHAVEN NATL LAB, DEPT 



BIOL, 

UPTON, NY, 11973 



Ur iuin, in i, n"-' 

STANDARDS^ (MAY/JlJN 1996) Vol 101, No. 3, pp. 

231-241. 

ISSN: 1044-677X. 
DOCUMENT TYPE: Article; Journal 
FILE SEGMENT: PHYS; ENGI 
LANGUAGE. ENGLISH 

REFERENCE^3UNT^^ 24 ^ ^ ^ IALL FORMATS* 

AB Protein Data Bank (PDB) is an archive of experimentally 
determined three-dimensional structures ot 

by 3DB, the Three-Dimensional Database ot 

^molecular Structures .that ^J^^ 

systems. 3DB will operate as a direct-deposmon arch.ve that w * 
arcent third-party supplied annotations. Conyersion of PDB to JUB wn 
SSSX Providing a high degree of compatibility w.h ex.stmg 



software 



L17 ANSWER 1 OF 50 BIOSIS COPYRIGHT 2001 BIOSIS 
ACCESSION NUMBER: 2001:307268 BIOSIS 

state ensemble: Implications for fold recognition. 

Galveston, TX, 77555-1055: yince@hbcg.utmb.edu USA 
SOURCE: Protein Science, (May, 2001) Vol. 10, No. 5, pp. 1032 1045. 

print. 

ISSN: 0961-8368. 
DOCUMENT TYPE: Article 
LANGUAGE. English 



contains all of the thermodynamic mformation neces^ry ■ to encode a 

f:rrraSS 

often aligned correctly with their correspond,^ . target p offles 
testing that calculated thermodynamic stab.lrty profiles have the 

of protems usrng DS^ndary tru n ^ ^rf, 

matrix, tnstead of CORbX residue staui j y x - d stability 

etembts be!t represented by a time-averaged structure. 

T 17 ANSWER 3 OF 50 BIOSIS COPYRIGHT 2001 BIOS1S 
ATCEisiONNUMBER: 1999:496274 BIOSIS 
SScSn NUMBER: PREVW9900496274 
?ITLE Analysis and assessment of ab m,t,o three- 

dimensional prediction, secondary structure, and 

SOURCE: ^ Proteins, (1999) Vol.0, No. SUPPL. 3, pp. 149-170. 

ISSN: 0887-3585. 
DOCUMENT TYPE. Article 
LANGUAGE: English 



data with 507 datasets for fifteen selected targets and sixty-one groups 
participating. As with CASP2, methods ranged from computationally 
fntensive strategies that attempt to recreate the physical and ^'«* 
forces involved in protein folding to the more recent knowledge-based 
approaches. These exploit information from the structure databases 
extracting potentially similar fragments and/or distance constraints 
d S o 8 m multiple'sequence alignments. The knowledge-based approaches 
generally gave more consistently successful predictions across the range 
of targets, particularly that of the Baker group (Bystroff and Baker J 
MolBiol 1998;281:565-577; Simons et al. Proteins Suppl 1999,3 J 7 1-1 76), 
which used a fragment library. In the secondary structure P^c-n 
category, the most successful approaches built on the concepts used in PHD 
(Rost et al Comput Appl Biosci 1994; 10:53-60), an accepted standard 
nthis field. Like PHD, they exploit neural networks but have different 
st ategies for incorporating multiple sequence data or Posmon-dependent 
Light matrices for training the networks. Analysis of the contact data, 
for which only six groups participated, suggested that as yet this data 
provides a rather weak signal. However, in combination ^with other types of 
prediction data it can sometimes be a useful constraint for identifying 
the correct structure. 

L17 ANSWER 4 OF 50 BIOSIS COPYRIGHT 200 1 BIO SIS 
ACCESSION NUMBER: 1998.400280 BIOSIS 
DOCUMENT NUMBER: PREV 199800400280 
TITLE: Prediction of local structure in proteins using a library 

of sequence-structure motifs. 

™1?esour^^^ 

SOURCE: Journal of Molecular Biology, (Aug. 21, 1998) Vol. 281, No. 

3, pp. 565-577. 

ISSN: 0022-2836. 
DOCUMENT TYPE: Article 
LANGUAGE: English 

AB We describe a new method for local protein structure 
prediction based on a library of short sequence pattern that 
correlate strongly with protein three-dimensional 
structural elements. The library was generated using an automated method 
for finding correlations between protein sequence and local structure, and 
contains most previously described local f!^^^^™ 
as well as new relationships, including a diverging type-II beta-turn, a 
frayed helix, and a proline-terminated helix. The query sequence is 
scanned for segments 7 to 19 residues in length that strongly match one of 
the 82 patterns in the library. Matching segments are assigned the 
three-dimensional structure characteristic of the tUontirt > 
corresponding sequence pattern, and backbone torsion angles for the entire 



query sequence are then predicted by piecing together mutually compatible 
segment predictions. In predictions of local structure in a test set ot 55 
proteins about 50% of all residues, and 76% of residues covered by 
high-confidence predictions, were found in eight-residue segments within 
1 4 ANG of their-ta.e structures. The predictions are complementary to 
traditional secondary structure predictions because they are considerably 
more specific in turn regions, and may contribute to ab initio tertiary 
structure prediction and fold recognition. 

L17 ANSWER 5 OF 50 BIOSIS COPYRIGHT 2001 BIOSIS 
ACCESSION NUMBER. 1998:130372 BIOSIS 
DOCUMENT NUMBER: PREV 199800 130372 
TITLE An integrated sequence-structure database 

incorporating matching mRNA sequence, amino acid sequence 

and protein three-dimensional structure 

AUTHOR(S) ata ' Adzhubei, Ivan A.; Adzhubei, Alexei A. (1); Neidle Stephen 
CORPORATE SOURCE: (1) CRC Biomolecular Structure Unit, Inst. Cancer Res., 

Sutton, Surrey SM2 5NG UK 
SOURCE: Nucleic Acids Research, (Jan. 1, 1998) Vol. 26, No. 1, pp. 

327-331. 

ISSN: 0305-1048. 
DOCUMENT TYPE. Article 
LANGUAGE: English 

AB We have constructed a non-homologous database, termed the 
Integrated Sequence-Structure Database (ISSD) which comprises 
the Coding sequences of genes, amino acid sequences of the corresponding 
proteins, their secondary structure and variant phi,psi angles _ 
assignments, and polypeptide backbone coordinates. Each protein entry in 
the database holds the alignment of nucleotide sequence, amino 
acid sequence and the PDB three-dimensional structure 
data The nucleotide and amino acid sequences for each entry are selected 
on the basis of exact matches of the source organism and cell environment. 
The current version 1 .0 of ISSD is available on the WWW at 
http //www protein.bio.msu.su/issd/ and includes 107 non-homologous 
mammalian proteins, of which 80 are human proteins. The database 
has been used by us for the analysis of synonymous codon usage patterns in 
mRNA sequences showing their correlation with the three- 
dimensional structure features in the encoded proteins. Possible 
ISSD applications include optimization of protein expression, improvement 
of the protein structure prediction 

accuracy, and analysis of evolutionary aspects of the nucleotide 
sequence-protein structure relationship. 

L17 ANSWER 6 OF 50 BIOSIS COPYRIGHT 2001 B1OSIS 
ACCESSION NUMBER: 1997:438129 BIOSIS 



CORPORATE SOURCE: European Heide , berg Germany 

SOURCE ""'Sffi^ * - 34 " 56 

DOCUMENT TYPE: Article 
I ANGUAGE English 

AB The problem of predict! ng P™^™^^ despite more than three 
from the sequence new and promising methods 

decades of intensive ««*J^ * prediction have 
in three-din.ens.onal (3D), 2U ana it V (he te ,„ 

reopened the field. ^^1°"^. and incorrect models 
databases -f^^Sd by analysis of 
(3D). Inter-residue contacts (W) can Secondary strncture, 

correlated mutations, albeit j ™" ^ dicted with 

solvent accessibility and ^^^rjequence alignments. Some 
significantly ^'SSi i and reliable enough 
of these new prediction met hods ^ p . ta , struc ture 
t0 be useful in SJ^^^X^^-o^fal — is 

L17 ANSWER 8 OF 50 BIO^ COPYRTGHT 2001 BfOSXS 
ACCESSION NUMBER. 1995:106286 BlOSlb 

T sequence-structure compatibility method. 

565 Japan 3 No 11, pp. 2055-2063. 

SOURCE: Protein Science, (1994) Vol. 

ISSN: 0961-8368. 
DOCUMENT TYPE. Article 
t amquaGE English 

AB A method for V-^^^^y of an amino acid 

identifies the most likely *«rtw* The ^structures of the 

number of sequences in a drtta e, a d *e * ^ ^ ^ ^ 

following prote.ns were P«*^ ' (IIAB-Man), (3) rat tyrosine 

aminotransferase (Tyr AT), an IK It dictions are discussed, 

functional and «"^X'J^ kinase was 



^— 

UAB-Man and galactose/glucose-b nd ng p £ domains of 

HA and UB domains are aligned with the N ana ^ of 

IlABx-Man are threaded o^^^ phospho ryl transfer from His 10 
of GGBP. The prediction accoun for £he M> P ^ 
to His 175, and to the sugar J^j^ te AT was predicted, as 

between rat Tyr AT and ^^^^^^^ 
well as (4) the structural sum anty between ^ § 

^^^^^^^ 

the proteins 

profites derived from ,hr«e-dimens,onal 

AUTHORfSr^Ou-nis, Chnstos; Sander, Chris; 

Schneider, Reinhard D . 690o Heidelberg Germany 



SOURCE. 

805-825. 
ISSN: 0022-2836 
DOCUMENT TYPE. Article 



JJU^U1VUJ-1> x ^ 

is formulated here as that of evalua ting com p U cated 
fits a hypothetical structure. The simp es^a ^ ^ 

approaches, secondary fitness problems. Here, 

calculations, can be viewed as sequence tru ^ (1) 

an approach of intermediate complexi ty * , 
description of a protem ^^^° cont act. (3) generation of 
with both intra-protem and P™*^ b lacing the input sequence m 
numerous hypothetical model a t™ c tu«*s in 

a large set of known ^^^J^ m odels by summing the 
all possible alignments, W^^ 1 d dent core weights derived 
sequence preferences ove all struct ura p Qf the method are 



in a database of sequence, T e m ^ ^ ^ Qf 

sequence can very well find nat v such as 

alternatives, in correct alignment low sequen ce 

(beta-alpha)-n untts, can be d elected in p. t ^ J dependence on 

similarity; (3) remote homo ogue e detect , ^ ^ 

the set of parameters -^^^p™"** « * sim .P le „ 
superior to classical secondary si ruci * P^ protein-protein and 
interface description m terms w P ell; {6) the use of core 

protein-water contacts, P erforra ^'P™'S tec tion of remote homologues, 
weights considerably improves a«»™V ithTmyoglobin contact 

DOCUMENT ~ an S men , methods applied to protein 

CORPORATE SOmCE jO^ ^ ^ ^ ^ Montpelher 

SOURCE FranCe Biochim i e(Paris),( 1 993)Vol.75,No.5,PP.3»-36,. 
SOURCE. ISSN:0300 . 9084 
DOCUMENT TYPE: Article 

LANGUAGE: English tructura l knowledge specific to a 

AB Learning techniques are abk ^ o ex ract s ^ 

selected set of proteins. We tec * two g ^ a , oca , fold . 

expressing the propensity of » j^^™^ prediction rules based 
The first algorithm generates "^£™£d L„d in the learning 
„„ a dictionary of geomet L indicate the 
database. The second algonthm leads to s environme nt. 
fit between an amino acid and a given , loca , information profiles 

Dynamic programming is then used Jo a hgn «™« &nction! , The 

by modifying the local «^££o*. structural prediction 

L17 ANSWER 12 OF 50 BIOS1S COPYRIGHT 2001 B10SIS 



ACCESSION NUMBER. ^132440 BIOSIS 

TTTLE^^^ ^dI^ABASE OF HOMOLOGY-DER.IVED PROTEIN 

STRUCTURES^ STRUCTURAL MEAN1NG OF SEQUENCE ALIGNMENT. 

:: RC , 

S0U CODEN: PSFGEY. ISSN. 0887-3585. 

FILE SEGMENT: BA; OLD 
LANGUAGE. English 

AB The database of known prote » ed fe the use 

dimensional ^^ctures can be sign ficarty ations . (1) The 

predicting protein ^^Xmotog, can be inferred from the 
building by homology. (3 S ^'?™ t ^ eshol f 0 f sequence similarity 
level of sequence similarity. (4) The thresho a t 
sufficient for structural J°^£52S£bS£«. sequence 
alignment. Here, we Erst quantify the elauon exhaus ,ive 
similarity, structure «™^^^Ci^Sr-i« and report a 
survey of alignments b ^"/™ n X of alignment length. We then 
homology threshold curve as a function^ °^ 8 of 
produce a database of J^»'°^™ tt 3 kno wn structure all 
proteins (HSSP) by aligning to each £ threshold curw . For each 

matches in sequence diabase sea * * P s , ructural role 

detail by homology. 

L17 ANSWER 17 OF 50 EMBASE COPY^GHT 2001 ELSEVIER SCI B.V. 
ACCKSION NUMBER: 93207592 EMBASE 
DOCUMENT NUMBER: 1993207592 
TITLE Inverted protein structure 

prediction 



AUTHOR. Bowie J.U.; Eisen ^ e 'f f Institute Univ 0 f California at Los 

SOURCE: ""'^Opin^inan-^B^^^)" 

f S S 7 N 4 0 4 9 59-440X CODEN: COSBEF 
milNTRY United Kingdom 

™« ortSB^andMedi. 
Instrumentation 
029 Clinical Biochemistry 
LANGUAGE. English 

SUMMARY LANGUAGE. English ^ ^ bg dassified 

of known structures provrdes nu* , 
similar folds, with some ,. each oldmS d ss *J „ that share 

inverted protein structure predict ion ^to Here 
determine whether an >«T^^^ tnK t m prediction The 

we review the recent, ™P»^ ™° tha t, instead of looking for 

directly to three-dimensional folds. 
THE GENUINE ARTICLE: 403PK 

TITLE Protein structure prediction zX . HonigB 

"atesoi^^^ 

Mo1 ■ u «n w 1 68th St New York, NY 10032 USA (Repn"* 

ffiSS^ 'a^hes Med ,„st, Dep. Biochem & Mo. 

Biophys, New York, NY 10032 USA 

So" ° F ^SnTOHNION IN CHEMICAL BIOLOGY, (FEB 2001) Vol. 

Publisher CURRENT BIOLOGY LTD, 84 THEOBALDS RD, LONDON 

WC1X8RR, ENGLAND. 
ISSN: 1367-5931. 



DOCUMENT TYPE: General Rev.ew; Journal 
LANGUAGE: English 

REFERENCE COUNT: 52 ^ , ALL FORMATS 

*ABSTRACT IS AVAILABLE IN ^ and 

AB The pred.ct.on of protein struct ■ activlty . 

structure homology, has be mfe y P of applicab ,l,t y 

Homology models "ave become more ac cu ate a ^ ^ 

has increased ^'^Z^S'o ^ pas. few years, and also 

^ searches, *e - - m0deU " g 

conformations. There have ^T^X corresponding use 

^^^^^^"^^ 

incorrectly folded protein conformauons. 

„, ™: «n SEARCH COPYRIGHT 2001 IS1(R) 
S 200 S ™. SCISEARCH 
TTTl^E^^ 111 ^ ProteuUhreadtng^ising PROSPECT . Design and evaluation 

BIOSCISEC "660 COMMERCE PKDR.OAKR.DGE.TN37830 (Reprint) 
COUNTRY OF ™R ei USA trbctore ^ GENET1CS , (,5 

AUG2000) Vol.40, NoAPP y 343-354. EY & S0NS INC, 605 

^A*» NY ,0.58-00,2. 
ISSN: 0887-3585. 
DOCUMENT TYPE. Article; Journal 
FILE SEGMENT. LIFE 
LANGUAGE: English 

REFERENCE COUNT. 27 AND I ALL FORMATS* 

* ABSTRACT ISAVi^^ foM usmg the 

AB The computer syste m .^OSreCT to th p ^ & g . ven 

threading method is described and PROSPECT guarantees to 

target protein sequence and a t emplate »tru«u^ ^ scoring 

fJaglobW^ 



close (7 Angstrom or less between the l oe target _ t emplate 
implementation). ?n * te* * ^JSmily and having sequence 
proteins, each pan being from jthe ^upe > ' ^ % ^ ^ 
identity less than or equa to f ^ ? f { ^ ctn J y align able residues 
templates correctly and aligns 66/0 ^^ ^"^^^ 50/° fold recognition and 
correctly- These numbers ™^^J^ ly scoring terms 1), 
64% alignment accuracy for the same test set ^ b j, 
i 0 , and (iv), indicating the ^^^^^thcr improved to 
term, The fold recognition ^^™^^ truet u« information 
72% and 74%, respectively, when the ^ 0 ™ a St pROSPECT also allows a 
predicted by the PHD program is disulfide 
Ssertoincorpor^ 

bonds, active sites, and NOE dj stance^ ^ > j threading under 

process. The system rigorously find 8™**; \ he constra ints 

the specified constraints. Test results have , shown mat t 2000;40;3 43-354. 

can further improve the performance of PROSPECI, 

(C) 2000 Wiley-Liss, Inc. 
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"ACCUSER 2000.469365 SCISEARCH 

THE GENUINE ARTICLE. 325GC 

TITLE Protein structure prediction 

in the postgenomic era 

MTDDX, . 

ENGLAND (Reprint) 

SOURCE Y ° F AU CUKMNT^PIWON IN STRUCTURAL BIOLOGY, (JUN 2000) 
Vol. 10, 

Publisher; CIJRRENT BIOLOGY LTD, 84 THEOBALDS RD, LONDON 

WC1X8RR, ENGLAND. 

ISSN: 0959-440X. 
DOCUMENT TYPE: Article; Journal 
FILE SEGMENT. LIFE 
LANGUAGE. English 

REFERENCE COUNT: 55 AND I ALL FORMATS* 

•ABSTRACT IS AVAILABLE IN 1H* ^ the 
AB As the number of completely ever more 

postgenomic ^^^^P^M by genes of 



of these proteins. There are limitations to this approach to ^gene 
InSon and a survey of the expected rehab.hty of Afferent 
protein structure prediction techniques has 
been undertaken. 

L17 ANSWER 25 OF 50 SC1SEARCH COPYRIGHT 200 1 1SI (R) 
A^cS 1999:954399 SCISEARCH 

THE GENUINE ARTICLE: 262VB 
TITLE Predicting protein three-dimensional 

structure 

corporate ' Kv Maryland, Maryland blotechnol dot. 

CTR ADV KS oTEcmoi ?6oo 0UDELSKY DR ROCKVILLE, MD 20850 
(Reprint) 

COUNTRY OF A^ N f 0 A pM0N IN BIO TECHNOLOGY, (DEC ,999, Vol. !0. 
No. 

Publisher "CURRENT BIOLOGY LTD, 34-42 CLEVELAND STREET, 

LONDON W1P 6LE, ENGLAND. 

ISSN. 0958-1669. 
DOCUMENT TYPE. General Review; Journal 
FILE SEGMENT: LIFE 
LANGUAGE: English 

«™"f SScT S AVAHABLE n. TUB «!•»• 

There is still a long way to go before the general ab 



MACROMOLEC^BIOL, AVE cp ^ ^ ggLGlUM (Reprint) 

SOURCE^ ° F A ^RO^ElN ENGINEERING, (SEP .995) Vol. 8. No. 9, pp. 849-8: 

ISSN: 0269-2139. 
DOCUMENT TYPE: General Review; Journal 
FILE SEGMENT. LIFE 
LANGUAGE: ENGLISH 

mENC f^l CT £ AVAILABLE IN THE ALL AND TALL FORMATS* 
AB Database-derived potentials, compiled from frequences of 

this taction does no. depend on the conformation, tep^> « 
considered to be valid for scoring the compatibility of different 

corrected for the potentials approximate a physically meaningful tree 

*e conXed state, the free energy difference reduces to the , commonly 
used scoring schema or contains additional terms tha depend on the 
«*mence In both cases, all the terms ran be derived from 
sZence'i" »c ture frequencies in the database. Such free energy 
dlffe enrcommonly defined as the folding free energy, is a measure of 
l™t HabilUy and can be used for scoring both forward and inverted 
p™,ei» folding The implications for the use of know.edge-based 
potentials in protein structure prediction „ mllld 

are described Finally, the difficulty of designing tests that could 
validate the proposed approach, and the inherent limitations of such 
tests, are discussed. 

L17 ANSWER 44 OF 50 SCISEARCH COPYRIGHT 2001 ISI (R) 

ACCESSION NUMBER: 93:544573 SCISEARCH 

TrTLE^^^^ OPTIMAL NEURAL NETWORKS FOR PROTEIN- 
STRUCTURE PREDICTION 



(Reprint); AT&T ^ ^ MURRAY HILL, NJ. 07974 

SOURCE:' 1 ' ° F A ™CAL S 4v lE W E, (AUG ,993) Vol 48, No. 2, pp. 

1502-1515. 

ISSN: 1063-65 IX. 
DOCUMENT TYPE: Article; Journal 
FILE SEGMENT. PHYS 
LANGUAGE. ENGLISH 

^ FER ENCE^OUNT c 32 ^ ^ ^ ^ L TS * 

AB The suS application of neural-network algorithms for predion 
rfoS^SS.™^ stymied by three problem areas: the sparsity of 
iM^ok^n protein structures, poorly devised ne^ork 

trah to provide network algorithms that genuinely impact on the ability 

so that transfer of neural-network technology to more realistic 
three-dimensional proteins is evident 
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AB A review and discussion with 69 refs. In the context of protein 
structure prediction, there are 2 principle reasons for 
comparing and aligning protein sequences: (l)jo obtain an accurate 
alignment which may be for protein modeling by comparison to proteins of 
vknown 3-dimensional structure and (2) to scan a 
database with a newly detd. protein sequence and identify possible 
functions for the protein by analogy with well-characterized proteins In 
this chapter, I review the underlying principles and techniques tor 
sequence comparison as applied to proteins and used to satisfy these 2 
aims. 
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AB The 3D-1D compatibility method is a new approach to protein 
structure prediction. It evaluates the compatibility of 
a one-dimensional (ID) amino acid sequence with known three- 
dimensional (3D) structures, and select the most likely structure^ 
wThave developed a method, which evaluates the 3D-1D compatibility usmg 
the following functions, side-chain packing, solvation, hydrogen-bonding 
and local conformation Amotions. The method has been applied to a large 
no of sequences in databases. Here, the predictions of the 
structural similarities between the following pairs are described in _ 
detail: spermidine/putrescine-binding protein and maltose-binding pro em, 
shikimate kinase and adenylate kinase, and mannose permease hydrophihc 
subunit (IIABMan) and galactose/glucose-binding protein. 
evolutionary implications of the predictions are discussed^ Through these 
examples of predictions, the present work demonstrates the promise of the 
3D- ID method. 



